Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 215094 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 177.6 MiB |
| Average record size in memory | 865.7 B |
Variable types
| CAT | 9 |
|---|---|
| NUM | 8 |
Reproduction
| Analysis started | 2020-03-05 10:30:49.192956 |
|---|---|
| Analysis finished | 2020-03-05 10:41:24.143972 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
회원번호 has a high cardinality: 215094 distinct values | High cardinality |
회원이름 has a high cardinality: 150751 distinct values | High cardinality |
가입일자 has a high cardinality: 3593 distinct values | High cardinality |
최종불입일자 has a high cardinality: 3252 distinct values | High cardinality |
담당자 has a high cardinality: 2157 distinct values | High cardinality |
부서 has a high cardinality: 575 distinct values | High cardinality |
가입일자 only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
최종불입일자 only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
해약금액 has 176137 (81.9%) zeros | Zeros |
연체횟수 has 118228 (55.0%) zeros | Zeros |
| Distinct count | 215094 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120606.92464224943 |
|---|---|
| Minimum | 0 |
| Maximum | 234612 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12196.65 |
| Q1 | 64479.25 |
| median | 121871.5 |
| Q3 | 178243.75 |
| 95-th percentile | 222708.35 |
| Maximum | 234612 |
| Range | 234612 |
| Interquartile range (IQR) | 113764.5 |
Descriptive statistics
| Standard deviation | 66721.49409 |
|---|---|
| Coefficient of variation (CV) | 0.553214455 |
| Kurtosis | -1.159724153 |
| Mean | 120606.9246 |
| Median Absolute Deviation (MAD) | 57552.93947 |
| Skewness | -0.06307497504 |
| Sum | 2.594182585e+10 |
| Variance | 4451757774 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 11343.5 19639.5 36392.5 53461.5 ... 226652.5 228076.5 228248.5 228347.5 234612. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 213309 | 1 | < 0.1% | |
| 4439 | 1 | < 0.1% | |
| 6486 | 1 | < 0.1% | |
| 341 | 1 | < 0.1% | |
| 2388 | 1 | < 0.1% | |
| 14674 | 1 | < 0.1% | |
| 8529 | 1 | < 0.1% | |
| 10576 | 1 | < 0.1% | |
| 53583 | 1 | < 0.1% | |
| Other values (215084) | 215084 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 234612 | 1 | < 0.1% | |
| 234611 | 1 | < 0.1% | |
| 234610 | 1 | < 0.1% | |
| 234609 | 1 | < 0.1% | |
| 234607 | 1 | < 0.1% |
| Distinct count | 215094 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 3011A03755 | 1 |
|---|---|
| 3011A04136 | 1 |
| 210A004749 | 1 |
| 218A030696 | 1 |
| 217B014992 | 1 |
| Other values (215089) |
| Value | Count | Frequency (%) | |
| 3011A03755 | 1 | < 0.1% | |
| 3011A04136 | 1 | < 0.1% | |
| 210A004749 | 1 | < 0.1% | |
| 218A030696 | 1 | < 0.1% | |
| 217B014992 | 1 | < 0.1% | |
| 218A067289 | 1 | < 0.1% | |
| 1022A05193 | 1 | < 0.1% | |
| 214A005552 | 1 | < 0.1% | |
| 214A003441 | 1 | < 0.1% | |
| 218A032505 | 1 | < 0.1% | |
| Other values (215084) | 215084 | > 99.9% |
Length
| Max length | 11 |
|---|---|
| Mean length | 10.00073456 |
| Min length | 10 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 16 | 57.1% | |
| Decimal_Number | 10 | 35.7% | |
| Connector_Punctuation | 1 | 3.6% | |
| Dash_Punctuation | 1 | 3.6% |
| Value | Count | Frequency (%) | |
| Latin | 16 | 57.1% | |
| Common | 12 | 42.9% |
| Value | Count | Frequency (%) | |
| ASCII | 28 | 100.0% |
| Distinct count | 150751 |
|---|---|
| Unique (%) | 70.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 주식회사에프피에이110111 | 70 |
|---|---|
| 임덕길470529 | 24 |
| 대성건설(주)124-81 | 20 |
| 백승식720219 | 12 |
| 강성복551227 | 12 |
| Other values (150746) |
| Value | Count | Frequency (%) | |
| 주식회사에프피에이110111 | 70 | < 0.1% | |
| 임덕길470529 | 24 | < 0.1% | |
| 대성건설(주)124-81 | 20 | < 0.1% | |
| 백승식720219 | 12 | < 0.1% | |
| 강성복551227 | 12 | < 0.1% | |
| (주)지케이씨교역540605 | 10 | < 0.1% | |
| 김숙경550210 | 9 | < 0.1% | |
| 안규덕631212 | 8 | < 0.1% | |
| 조상영551213 | 8 | < 0.1% | |
| 김현영731018 | 8 | < 0.1% | |
| Other values (150741) | 214913 | 99.9% |
Length
| Max length | 25 |
|---|---|
| Mean length | 9.006541326 |
| Min length | 8 |
| Value | Count | Frequency (%) | |
| Other_Letter | 502 | 90.6% | |
| Uppercase_Letter | 26 | 4.7% | |
| Decimal_Number | 10 | 1.8% | |
| Lowercase_Letter | 9 | 1.6% | |
| Other_Punctuation | 3 | 0.5% | |
| Dash_Punctuation | 1 | 0.2% | |
| Open_Punctuation | 1 | 0.2% | |
| Space_Separator | 1 | 0.2% | |
| Close_Punctuation | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| Hangul | 502 | 90.6% | |
| Latin | 35 | 6.3% | |
| Common | 17 | 3.1% |
| Value | Count | Frequency (%) | |
| Hangul | 502 | 90.6% | |
| ASCII | 52 | 9.4% |
주소
Categorical
| Distinct count | 46 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 경기 | |
|---|---|
| 서울 | |
| 인천 | |
| 경상 | 11440 |
| 광주 | 9080 |
| Other values (41) |
| Value | Count | Frequency (%) | |
| 경기 | 59425 | 27.6% | |
| 서울 | 54498 | 25.3% | |
| 인천 | 16745 | 7.8% | |
| 경상 | 11440 | 5.3% | |
| 광주 | 9080 | 4.2% | |
| 부산 | 8535 | 4.0% | |
| 전라 | 8133 | 3.8% | |
| 강원 | 7861 | 3.7% | |
| 충청 | 7778 | 3.6% | |
| 대전 | 6562 | 3.1% | |
| Other values (36) | 25037 | 11.6% |
Length
| Max length | 2 |
|---|---|
| Mean length | 1.999958158 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Other_Letter | 46 | 93.9% | |
| Other_Punctuation | 1 | 2.0% | |
| Space_Separator | 1 | 2.0% | |
| Decimal_Number | 1 | 2.0% |
| Value | Count | Frequency (%) | |
| Hangul | 46 | 93.9% | |
| Common | 3 | 6.1% |
| Value | Count | Frequency (%) | |
| Hangul | 46 | 93.9% | |
| ASCII | 3 | 6.1% |
상태
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 가입 | |
|---|---|
| 해약 | |
| 만기 | 10804 |
| 행사 | 9472 |
| 만기_해약 | 3665 |
| Value | Count | Frequency (%) | |
| 가입 | 106467 | 49.5% | |
| 해약 | 84623 | 39.3% | |
| 만기 | 10804 | 5.0% | |
| 행사 | 9472 | 4.4% | |
| 만기_해약 | 3665 | 1.7% | |
| 해지 | 63 | < 0.1% |
Length
| Max length | 5 |
|---|---|
| Mean length | 2.051117186 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Other_Letter | 9 | 90.0% | |
| Connector_Punctuation | 1 | 10.0% |
| Value | Count | Frequency (%) | |
| Hangul | 9 | 90.0% | |
| Common | 1 | 10.0% |
| Value | Count | Frequency (%) | |
| Hangul | 9 | 90.0% | |
| ASCII | 1 | 10.0% |
| Distinct count | 3593 |
|---|---|
| Unique (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 2018-07-25 | 698 |
|---|---|
| 2018-11-29 | 662 |
| 2014-03-03 | 642 |
| 2018-11-28 | 609 |
| 2018-11-19 | 601 |
| Other values (3588) |
| Value | Count | Frequency (%) | |
| 2018-07-25 | 698 | 0.3% | |
| 2018-11-29 | 662 | 0.3% | |
| 2014-03-03 | 642 | 0.3% | |
| 2018-11-28 | 609 | 0.3% | |
| 2018-11-19 | 601 | 0.3% | |
| 2018-12-17 | 600 | 0.3% | |
| 2018-12-24 | 581 | 0.3% | |
| 2018-11-26 | 579 | 0.3% | |
| 2014-07-28 | 578 | 0.3% | |
| 2018-11-22 | 574 | 0.3% | |
| Other values (3583) | 208970 | 97.2% |
Length
| Max length | 10 |
|---|---|
| Mean length | 10 |
| Min length | 10 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 90.9% | |
| Dash_Punctuation | 1 | 9.1% |
| Value | Count | Frequency (%) | |
| Common | 11 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 11 | 100.0% |
| Distinct count | 3252 |
|---|---|
| Unique (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 2018-12-26 | |
|---|---|
| 2018-12-17 | 13932 |
| 2018-12-10 | 13718 |
| 2018-12-20 | 13366 |
| 2018-12-05 | 11042 |
| Other values (3247) |
| Value | Count | Frequency (%) | |
| 2018-12-26 | 36888 | 17.1% | |
| 2018-12-17 | 13932 | 6.5% | |
| 2018-12-10 | 13718 | 6.4% | |
| 2018-12-20 | 13366 | 6.2% | |
| 2018-12-05 | 11042 | 5.1% | |
| 2018-12-31 | 2759 | 1.3% | |
| 2018-11-26 | 1239 | 0.6% | |
| 2018-12-12 | 1237 | 0.6% | |
| 2018-12-28 | 881 | 0.4% | |
| 2018-12-14 | 793 | 0.4% | |
| Other values (3242) | 119239 | 55.4% |
Length
| Max length | 10 |
|---|---|
| Mean length | 10 |
| Min length | 10 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 90.9% | |
| Dash_Punctuation | 1 | 9.1% |
| Value | Count | Frequency (%) | |
| Common | 11 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 11 | 100.0% |
총납입회차
Real number (ℝ≥0)
| Distinct count | 26 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 234.8908849154323 |
|---|---|
| Minimum | 1 |
| Maximum | 390 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 100 |
| Q1 | 120 |
| median | 250 |
| Q3 | 360 |
| 95-th percentile | 390 |
| Maximum | 390 |
| Range | 389 |
| Interquartile range (IQR) | 240 |
Descriptive statistics
| Standard deviation | 117.6439237 |
|---|---|
| Coefficient of variation (CV) | 0.5008449934 |
| Kurtosis | -1.624186994 |
| Mean | 234.8908849 |
| Median Absolute Deviation (MAD) | 107.2700267 |
| Skewness | 0.09744827917 |
| Sum | 50523620 |
| Variance | 13840.09278 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 20. 39.5 55. 62. ... 262. 282. 330. 375. 390. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 100 | 48068 | 22.3% | |
| 390 | 41279 | 19.2% | |
| 360 | 33260 | 15.5% | |
| 260 | 28902 | 13.4% | |
| 130 | 21202 | 9.9% | |
| 250 | 15651 | 7.3% | |
| 140 | 15100 | 7.0% | |
| 120 | 4969 | 2.3% | |
| 60 | 3564 | 1.7% | |
| 160 | 1273 | 0.6% | |
| Other values (16) | 1826 | 0.8% |
| Value | Count | Frequency (%) | |
| 1 | 207 | 0.1% | |
| 39 | 1 | < 0.1% | |
| 40 | 6 | < 0.1% | |
| 44 | 1 | < 0.1% | |
| 48 | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 390 | 41279 | 19.2% | |
| 360 | 33260 | 15.5% | |
| 300 | 48 | < 0.1% | |
| 264 | 217 | 0.1% | |
| 260 | 28902 | 13.4% |
최종불입회차
Real number (ℝ≥0)
| Distinct count | 139 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.09372181464848 |
|---|---|
| Minimum | 1 |
| Maximum | 488 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 13 |
| Q3 | 52 |
| 95-th percentile | 100 |
| Maximum | 488 |
| Range | 487 |
| Interquartile range (IQR) | 49 |
Descriptive statistics
| Standard deviation | 39.82158922 |
|---|---|
| Coefficient of variation (CV) | 1.280695487 |
| Kurtosis | 17.96679186 |
| Mean | 31.09372181 |
| Median Absolute Deviation (MAD) | 29.88623994 |
| Skewness | 2.926227284 |
| Sum | 6688073 |
| Variance | 1585.758968 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 ... 280. 330. 375. 439. 488. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 1 | 34541 | 16.1% | |
| 100 | 19128 | 8.9% | |
| 2 | 15158 | 7.0% | |
| 3 | 10277 | 4.8% | |
| 6 | 8230 | 3.8% | |
| 4 | 7548 | 3.5% | |
| 5 | 6984 | 3.2% | |
| 7 | 5307 | 2.5% | |
| 8 | 4968 | 2.3% | |
| 12 | 4361 | 2.0% | |
| Other values (129) | 98592 | 45.8% |
| Value | Count | Frequency (%) | |
| 1 | 34541 | 16.1% | |
| 2 | 15158 | 7.0% | |
| 3 | 10277 | 4.8% | |
| 4 | 7548 | 3.5% | |
| 5 | 6984 | 3.2% |
| Value | Count | Frequency (%) | |
| 488 | 1 | < 0.1% | |
| 390 | 257 | 0.1% | |
| 360 | 499 | 0.2% | |
| 300 | 2 | < 0.1% | |
| 260 | 116 | 0.1% |
상품금액
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6972672412991527 |
|---|---|
| Minimum | 0 |
| Maximum | 8 |
| Zeros | 43 |
| Zeros (%) | < 0.1% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.6990751499 |
|---|---|
| Coefficient of variation (CV) | 0.2591790458 |
| Kurtosis | 0.5855036068 |
| Mean | 2.697267241 |
| Median Absolute Deviation (MAD) | 0.5646757043 |
| Skewness | -0.5837855077 |
| Sum | 580166 |
| Variance | 0.4887060652 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4.5 6.5 8. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 3 | 131303 | 61.0% | |
| 2 | 54414 | 25.3% | |
| 4 | 15839 | 7.4% | |
| 1 | 13358 | 6.2% | |
| 5 | 127 | 0.1% | |
| 0 | 43 | < 0.1% | |
| 8 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 43 | < 0.1% | |
| 1 | 13358 | 6.2% | |
| 2 | 54414 | 25.3% | |
| 3 | 131303 | 61.0% | |
| 4 | 15839 | 7.4% |
| Value | Count | Frequency (%) | |
| 8 | 10 | < 0.1% | |
| 5 | 127 | 0.1% | |
| 4 | 15839 | 7.4% | |
| 3 | 131303 | 61.0% | |
| 2 | 54414 | 25.3% |
총불입액
Real number (ℝ≥0)
| Distinct count | 935 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 593565.8413298372 |
|---|---|
| Minimum | 750 |
| Maximum | 4880000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 750 |
|---|---|
| 5-th percentile | 10000 |
| Q1 | 45000 |
| median | 240000 |
| Q3 | 600000 |
| 95-th percentile | 2400000 |
| Maximum | 4880000 |
| Range | 4879250 |
| Interquartile range (IQR) | 555000 |
Descriptive statistics
| Standard deviation | 809766.6329 |
|---|---|
| Coefficient of variation (CV) | 1.364240623 |
| Kurtosis | 1.691116703 |
| Mean | 593565.8413 |
| Median Absolute Deviation (MAD) | 607784.8182 |
| Skewness | 1.649109372 |
| Sum | 1.276724511e+11 |
| Variance | 6.557219998e+11 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[7.500e+02 8.750e+02 1.250e+03 5.625e+03 9.125e+03 ... 3.306e+06 3.582e+06 3.750e+06 3.990e+06 4.880e+06], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 10000 | 13853 | 6.4% | |
| 2400000 | 13538 | 6.3% | |
| 30000 | 8544 | 4.0% | |
| 15000 | 7931 | 3.7% | |
| 1980000 | 6289 | 2.9% | |
| 20000 | 5958 | 2.8% | |
| 60000 | 5126 | 2.4% | |
| 90000 | 5048 | 2.3% | |
| 18000 | 4548 | 2.1% | |
| 180000 | 3714 | 1.7% | |
| Other values (925) | 140545 | 65.3% |
| Value | Count | Frequency (%) | |
| 750 | 137 | 0.1% | |
| 1000 | 49 | < 0.1% | |
| 1500 | 6 | < 0.1% | |
| 2000 | 7 | < 0.1% | |
| 2250 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4880000 | 1 | < 0.1% | |
| 4848000 | 1 | < 0.1% | |
| 4800000 | 3 | < 0.1% | |
| 4640000 | 1 | < 0.1% | |
| 4500000 | 13 | < 0.1% |
| Distinct count | 1824 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98528.11243921262 |
|---|---|
| Minimum | 0 |
| Maximum | 3876000 |
| Zeros | 176137 |
| Zeros (%) | 81.9% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 858350 |
| Maximum | 3876000 |
| Range | 3876000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 353750.1703 |
|---|---|
| Coefficient of variation (CV) | 3.590347583 |
| Kurtosis | 16.95820195 |
| Mean | 98528.11244 |
| Median Absolute Deviation (MAD) | 174273.7362 |
| Skewness | 4.112052435 |
| Sum | 2.119280582e+10 |
| Variance | 1.25139183e+11 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000000e+00 3.1500000e+02 6.9000000e+02 8.0000000e+02 1.1350000e+03 ... 2.6287250e+06 2.6507100e+06 2.6507225e+06 2.7045000e+06 3.8760000e+06], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 176137 | 81.9% | |
| 10000 | 7801 | 3.6% | |
| 15000 | 2470 | 1.1% | |
| 1944000 | 1955 | 0.9% | |
| 18000 | 1538 | 0.7% | |
| 20000 | 1431 | 0.7% | |
| 1603000 | 1119 | 0.5% | |
| 24000 | 1099 | 0.5% | |
| 28000 | 481 | 0.2% | |
| 30000 | 372 | 0.2% | |
| Other values (1814) | 20691 | 9.6% |
| Value | Count | Frequency (%) | |
| 0 | 176137 | 81.9% | |
| 630 | 17 | < 0.1% | |
| 750 | 120 | 0.1% | |
| 850 | 16 | < 0.1% | |
| 1000 | 33 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3876000 | 2 | < 0.1% | |
| 3664000 | 1 | < 0.1% | |
| 3600000 | 1 | < 0.1% | |
| 3163310 | 1 | < 0.1% | |
| 3149250 | 1 | < 0.1% |
| Distinct count | 2157 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 더피플라이프 | |
|---|---|
| 금강종합상조(주) | |
| 강대석 | 6336 |
| 김영권 | 4857 |
| 이덕술 | 4749 |
| Other values (2152) |
| Value | Count | Frequency (%) | |
| 더피플라이프 | 72013 | 33.5% | |
| 금강종합상조(주) | 28707 | 13.3% | |
| 강대석 | 6336 | 2.9% | |
| 김영권 | 4857 | 2.3% | |
| 이덕술 | 4749 | 2.2% | |
| 김영경 | 4241 | 2.0% | |
| 제이앤지 | 1848 | 0.9% | |
| 심상열 | 1818 | 0.8% | |
| 고달진 | 1703 | 0.8% | |
| 안미나 | 1489 | 0.7% | |
| Other values (2147) | 87333 | 40.6% |
Length
| Max length | 11 |
|---|---|
| Mean length | 4.831027365 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Other_Letter | 252 | 96.9% | |
| Uppercase_Letter | 4 | 1.5% | |
| Decimal_Number | 2 | 0.8% | |
| Open_Punctuation | 1 | 0.4% | |
| Close_Punctuation | 1 | 0.4% |
| Value | Count | Frequency (%) | |
| Hangul | 252 | 96.9% | |
| Common | 4 | 1.5% | |
| Latin | 4 | 1.5% |
| Value | Count | Frequency (%) | |
| Hangul | 252 | 96.9% | |
| ASCII | 8 | 3.1% |
| Distinct count | 575 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 차용갑 | 24354 |
|---|---|
| 김선 | 22611 |
| 관리부 | 20067 |
| 직영팀 | 15348 |
| 특판영업 | 10735 |
| Other values (570) |
| Value | Count | Frequency (%) | |
| 차용갑 | 24354 | 11.3% | |
| 김선 | 22611 | 10.5% | |
| 관리부 | 20067 | 9.3% | |
| 직영팀 | 15348 | 7.1% | |
| 특판영업 | 10735 | 5.0% | |
| 강대석 | 7831 | 3.6% | |
| CJ오쇼핑 | 7183 | 3.3% | |
| 이진희 | 6187 | 2.9% | |
| 이덕술 | 6074 | 2.8% | |
| 이충호(SM라이프) | 4636 | 2.2% | |
| Other values (565) | 90068 | 41.9% |
Length
| Max length | 12 |
|---|---|
| Mean length | 4.397765628 |
| Min length | 2 |
| Value | Count | Frequency (%) | |
| Other_Letter | 275 | 92.3% | |
| Decimal_Number | 10 | 3.4% | |
| Uppercase_Letter | 9 | 3.0% | |
| Open_Punctuation | 1 | 0.3% | |
| Space_Separator | 1 | 0.3% | |
| Close_Punctuation | 1 | 0.3% | |
| Other_Punctuation | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| Hangul | 275 | 92.3% | |
| Common | 14 | 4.7% | |
| Latin | 9 | 3.0% |
| Value | Count | Frequency (%) | |
| Hangul | 275 | 92.3% | |
| ASCII | 23 | 7.7% |
| Distinct count | 352 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.40128037044269 |
|---|---|
| Minimum | -447 |
| Maximum | 119 |
| Zeros | 118228 |
| Zeros (%) | 55.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | -447 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 34 |
| 95-th percentile | 92 |
| Maximum | 119 |
| Range | 566 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 37.12385834 |
|---|---|
| Coefficient of variation (CV) | 2.133398092 |
| Kurtosis | 24.98001465 |
| Mean | 17.40128037 |
| Median Absolute Deviation (MAD) | 25.41706959 |
| Skewness | -2.079096372 |
| Sum | 3742911 |
| Variance | 1378.180858 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-447. -385.5 -361.5 -342.5 -339.5 ... 99.5 109.5 116.5 118.5 119. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 118228 | 55.0% | |
| 1 | 4983 | 2.3% | |
| 99 | 3586 | 1.7% | |
| 2 | 2372 | 1.1% | |
| 5 | 1885 | 0.9% | |
| 57 | 1862 | 0.9% | |
| 3 | 1758 | 0.8% | |
| 4 | 1688 | 0.8% | |
| 50 | 1535 | 0.7% | |
| 53 | 1526 | 0.7% | |
| Other values (342) | 75671 | 35.2% |
| Value | Count | Frequency (%) | |
| -447 | 1 | < 0.1% | |
| -386 | 1 | < 0.1% | |
| -385 | 2 | < 0.1% | |
| -384 | 2 | < 0.1% | |
| -381 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 119 | 149 | 0.1% | |
| 118 | 83 | < 0.1% | |
| 117 | 60 | < 0.1% | |
| 116 | 52 | < 0.1% | |
| 115 | 42 | < 0.1% |
성별
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 여 | |
|---|---|
| 남 | |
| 기타 | 23 |
| Value | Count | Frequency (%) | |
| 여 | 113400 | 52.7% | |
| 남 | 101671 | 47.3% | |
| 기타 | 23 | < 0.1% |
Length
| Max length | 2 |
|---|---|
| Mean length | 1.00010693 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Other_Letter | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Hangul | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Hangul | 4 | 100.0% |
나이
Real number (ℝ≥0)
| Distinct count | 91 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.55914623373967 |
|---|---|
| Minimum | 21 |
| Maximum | 120 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 31 |
| Q1 | 46 |
| median | 56 |
| Q3 | 65 |
| 95-th percentile | 79 |
| Maximum | 120 |
| Range | 99 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 14.22322881 |
|---|---|
| Coefficient of variation (CV) | 0.2560015726 |
| Kurtosis | -0.4298260064 |
| Mean | 55.55914623 |
| Median Absolute Deviation (MAD) | 11.55502349 |
| Skewness | 0.007568506106 |
| Sum | 11950439 |
| Variance | 202.3002378 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 21. 22.5 23.5 24.5 25.5 ... 108.5 109.5 112. 119.5 120. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 60 | 6518 | 3.0% | |
| 59 | 6261 | 2.9% | |
| 58 | 6111 | 2.8% | |
| 52 | 6033 | 2.8% | |
| 55 | 5876 | 2.7% | |
| 56 | 5852 | 2.7% | |
| 61 | 5840 | 2.7% | |
| 53 | 5569 | 2.6% | |
| 62 | 5525 | 2.6% | |
| 54 | 5511 | 2.6% | |
| Other values (81) | 155998 | 72.5% |
| Value | Count | Frequency (%) | |
| 21 | 112 | 0.1% | |
| 22 | 182 | 0.1% | |
| 23 | 313 | 0.1% | |
| 24 | 483 | 0.2% | |
| 25 | 671 | 0.3% |
| Value | Count | Frequency (%) | |
| 120 | 24 | < 0.1% | |
| 119 | 3 | < 0.1% | |
| 118 | 1 | < 0.1% | |
| 117 | 1 | < 0.1% | |
| 114 | 2 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| df_index | 회원번호 | 회원이름 | 주소 | 상태 | 가입일자 | 최종불입일자 | 총납입회차 | 최종불입회차 | 상품금액 | 총불입액 | 해약금액 | 담당자 | 부서 | 연체횟수 | 성별 | 나이 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 0022A00001 | 이옥성590318 | 경기 | 만기_해약 | 2008-09-08 | 2016-12-20 | 100 | 100 | 2 | 2400000 | 1944000 | 더피플라이프 | 관리부 | 0 | 남 | 61 |
| 1 | 1 | 0072A00001 | 안성열581125 | 경기 | 행사 | 2007-09-28 | 2012-10-04 | 100 | 100 | 2 | 2400000 | 0 | 더피플라이프 | 관리부 | 0 | 남 | 62 |
| 2 | 2 | 0072A00002 | 배준택831121 | 부산 | 가입 | 2007-10-23 | 2013-05-20 | 100 | 68 | 2 | 1632000 | 0 | 더피플라이프 | 관리부 | 32 | 남 | 37 |
| 3 | 3 | 0072A00003 | 배민규821023 | 울산 | 행사 | 2007-10-23 | 2014-12-31 | 100 | 100 | 2 | 2400000 | 0 | 더피플라이프 | 관리부 | 0 | 남 | 38 |
| 4 | 4 | 0072A00006 | 최금순340728 | 경기 | 행사 | 2007-10-31 | 2016-01-05 | 100 | 100 | 2 | 2400000 | 0 | 더피플라이프 | 관리부 | 0 | 여 | 86 |
| 5 | 5 | 0072A00007 | 주병오520206 | 서울 | 해약 | 2007-11-19 | 2013-08-05 | 100 | 70 | 2 | 1680000 | 1212000 | 더피플라이프 | 관리부 | 30 | 남 | 68 |
| 6 | 6 | 0072A00021 | 정성제760210 | 서울 | 만기 | 2008-03-31 | 2016-06-20 | 100 | 100 | 2 | 2400000 | 0 | 더피플라이프 | 관리부 | 0 | 남 | 44 |
| 7 | 7 | 0072A00022 | 신영주541016 | 충청 | 만기 | 2008-04-14 | 2018-03-20 | 100 | 100 | 2 | 2400000 | 0 | 더피플라이프 | 관리부 | 0 | 여 | 66 |
| 8 | 8 | 0072A00026 | 윤일선521001 | 서울 | 해약 | 2008-09-23 | 2016-08-25 | 100 | 96 | 2 | 2304000 | 1866000 | 더피플라이프 | 관리부 | 4 | 여 | 68 |
| 9 | 9 | 0072A00027 | 김건용530815 | 서울 | 만기 | 2008-09-23 | 2016-12-30 | 100 | 100 | 2 | 2400000 | 0 | 더피플라이프 | 관리부 | 0 | 여 | 67 |
Last rows
| df_index | 회원번호 | 회원이름 | 주소 | 상태 | 가입일자 | 최종불입일자 | 총납입회차 | 최종불입회차 | 상품금액 | 총불입액 | 해약금액 | 담당자 | 부서 | 연체횟수 | 성별 | 나이 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 215084 | 234600 | U022A21089 | 백쌍순320814 | 부산 | 해약 | 2008-12-19 | 2013-09-23 | 100 | 58 | 2 | 1392000 | 912000 | 더피플라이프 | 관리부 | 42 | 여 | 88 |
| 215085 | 234601 | U022A21090 | 손희락520228 | 부산 | 해약 | 2008-12-19 | 2015-10-20 | 100 | 83 | 2 | 1992000 | 1613000 | 더피플라이프 | 관리부 | 17 | 남 | 68 |
| 215086 | 234602 | U022A21305 | 길준분660312 | 부산 | 해약 | 2008-12-22 | 2013-10-14 | 100 | 54 | 2 | 1296000 | 828000 | 더피플라이프 | 관리부 | 46 | 여 | 54 |
| 215087 | 234605 | U022A21379 | 김관수370127 | 경남 | 해약 | 2008-12-23 | 2010-07-26 | 100 | 20 | 2 | 480000 | 0 | 더피플라이프 | 관리부 | 80 | 남 | 83 |
| 215088 | 234606 | U022A22154 | 김순옥561026 | 부산 | 해약 | 2009-01-07 | 2009-03-20 | 100 | 3 | 2 | 72000 | 0 | 더피플라이프 | 관리부 | 97 | 여 | 64 |
| 215089 | 234607 | U022A22155 | 조길찬761012 | 부산 | 해약 | 2009-01-07 | 2009-04-27 | 100 | 3 | 2 | 72000 | 0 | 더피플라이프 | 관리부 | 97 | 남 | 44 |
| 215090 | 234609 | U244A00803 | 배상호820807 | 부산 | 해약 | 2008-12-16 | 2012-06-29 | 60 | 43 | 2 | 1720000 | 1240000 | 더피플라이프 | 관리부 | 17 | 남 | 38 |
| 215091 | 234610 | U244A00804 | 배규태830402 | 부산 | 만기 | 2008-12-16 | 2013-11-25 | 60 | 60 | 2 | 2400000 | 0 | 더피플라이프 | 관리부 | 0 | 남 | 37 |
| 215092 | 234611 | U244A00805 | 김군자490809 | 부산 | 해약 | 2008-12-16 | 2008-12-18 | 60 | 1 | 2 | 40000 | 0 | 더피플라이프 | 관리부 | 59 | 여 | 71 |
| 215093 | 234612 | U244A00806 | 정일선540801 | 부산 | 해약 | 2008-12-16 | 2009-11-25 | 60 | 12 | 2 | 480000 | 0 | 더피플라이프 | 관리부 | 48 | 여 | 66 |